A generating model for Finnish nominal inflection using distributional semantics
نویسندگان
چکیده
Abstract Finnish nouns are characterized by rich inflectional variation, with obligatory marking of case and number, optional possessive suffixes the possibility further cliticization. We present a model for conceptualization inflected nouns, using pre-compiled fasttext embeddings (300-dimensional semantic vectors that approximate words’ meanings). Instead deriving vector an word from another in its paradigm, we propose is conceptualized means summation latent representing meanings lexeme features. tested this on 2,000 most frequent their forms corpus (84 million tokens). Visualization space t-SNE clarified ‘main effects’ additive does not do justice to semantics inflection. In Finnish, how number realized turns out vary substantially case. Further interactions emerged clitics. By taking these into account, accuracy our model, evaluated as gold standard, improved 76% 89%. Analyses errors made 7.5% due overabundance (and hence true errors), 16.5% involved exchanges semantically highly similar stems (lexemes). Our results indicate, first, noun inflection more intricate than assumed thus far, second, intricacies can be captured surprisingly high simple generating based imputed lexemes, features,
منابع مشابه
Nominal Coercion in Space: Mass/Count Nouns and Distributional Semantics
English Theoretical linguists analyse all nouns as either mass or count, but admit that noun meanings can be shifted from one class to the other and classify these shifts. We use distributional semantic models to check how the theoretical analysis of mass-count meaning shifts relates to the actual usage of the nouns. Italiano In linguistica i sostantivi inglesi sono divisi in numerabili e non n...
متن کاملFinnish resources for evaluating language model semantics
Distributional language models have consistently been demonstrated to capture semantic properties of words. However, research into the methods for evaluating the accuracy of the modeled semantics has been limited, particularly for less-resourced languages. This research presents three resources for evaluating the semantic quality of Finnish language distributional models: (1) semantic similarit...
متن کاملA Semantics for Nominal Comparatives
This work adopts the perspective of plural logic and measurement theory in order rst to focus on the microstructure of comparative determiners; and second, to derive the properties of comparative determiners as these are studied in Generalized Quantiier Theory, locus of the most sophisticated semantic analysis of natural language determiners. The work here appears to be the rst to examine compa...
متن کاملA comprehensive model using modified Zeeman model for generating ECG signals
Developing a mathematical model for the artificial generation of electrocardiogram (ECG) signals is a subject that has been widely investigated. One of its uses is for the assessment of diagnostic ECG signal processing devices. So the model should have the capability of producing a wide range of ECG signals, with all the nuances that reflect the sickness to which humans are prone, and this ...
متن کاملGame Semantics in the Nominal Model
We present a model of games based on nominal sequences, which generalise sequences with atoms and a new notion of coabstraction. This gives a new, precise, and compositional mathematical treatment of justification pointers in game semantics.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Mental Lexicon
سال: 2023
ISSN: ['1871-1340', '1871-1375']
DOI: https://doi.org/10.1075/ml.22008.nik